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Inspiraling binaries of compact objects are primary targets for current and future gravitational- 
wave observatories. Waveforms computed in General Relativity are used to search for these sources, 
and will probably be used to extract source parameters from detected signals. However, if a different 
theory of gravity happens to be correct in the strong-field regime, source-parameter estimation may 
be affected by a fundamental bias: that is, by systematic errors induced due to the use of waveforms 
derived in the incorrect theory. If the deviations from General Relativity are not large enough 
to be detectable on their own and yet these systematic errors remain significant (i.e., larger than 
the statistical uncertainties in parameter estimation), fundamental bias cannot be corrected in a 
single observation, and becomes stealth bias. In this article we develop a scheme to determine in 
which cases stealth bias could be present in gravitational-wave astronomy. For a given observation, 
the answer depends on the detection signal-to-noise ratio and on the strength of the modified- 
gravity correction. As an example, we study three representative stellar-mass binary systems that 
will be detectable with second-generation ground-based observatories. We find that significant 
systematic bias can occur whether or not modified gravity can be positively detected, for correction 
strengths that are not currently excluded by any other experiment. Thus, stealth bias may be a 
generic feature of gravitational-wave detections, and it should be considered and characterized, using 
expanded models such as the parametrized post-Einstein framework, when interpreting the results 
of parameter-estimation analyses. 

PACS numbers: 04.80.Cc, 04.80.Nn, 04.30.-w, 04.50. Kd 



I. INTRODUCTION 

Although General Relativity (GR) has been shown to 
be an excellent description of Nature in all astronomical, 
astrophysical, and laboratory observations carried out so 
far jT|, this theory remains largely untested in the non- 
linear and dynamical strong-field regime, where gravita- 
tional fields are large and rapidly evolving, and velocities 
are not small. Future gravitational-wave (GW) obser- 
vations of the late inspiral and coalescence of compact 
binaries will probe this regime and allow for new tests of 
GR. To do so, however, the assumption that GR is cor- 
rect must be relaxed in GW science, and data analysis 
must be carried out with more general waveform families 
than those predicted by GR. 

The parameterized post-Einsteinian (ppE) frame- 
work [2] was recently proposed as a generic solution 
for this need. In this framework, one deforms the GR 
waveforms through amplitude {A) and phase (\I>) correc- 
tions expanded in a polynomial basis: (5^* = f3iu''^ and 
6 A = aiu"-', where u — irMf is the reduced GW fre- 
quency and {ai, /3i,ai,bi) are ppE parameters. Of these, 
ai and bi control the type of modified-gravity (MG) cor- 
rections, while ai and Pi control their magnitude. Recent 
data-analysis investigations of this framework, both in its 
full form |3j and in a reduced version restricting the al- 
lowed hi to those that appear in post-Newtonian (PN) 
expansions [3], suggest that it could be used in realistic 
settings. 

The simplest ppE model includes a single ampli- 
tude and phase correction to the waveform, and it 



is still sufficiently general to extend the PN-coefficient 
testfQ of Refs. [SHZ] to all currently studied MG the- 
ories, including Brans-Dicke theory [HUlSj. dynami- 
cal Chern-Simons gravity |17H21j . phenomenological 
massive-graviton propagation [SI [101114, 22.-.24j. gravita- 
tional Lorentz violation [5S], gravitational parity viola- 
tion [191 US] , violations of Local Position Invariance [27] , 
and the existence of extra-dimensions |28j . More com- 
plicated ppE models posit a piecewise-specified hybrid 
waveform model that includes the merger and ringdown 
phases, as well as multi-exponent inspiral corrections [2]. 

Neglecting the prospect that GR might be an inaccu- 
rate description of Nature in the strong-field regime may 
lead to the mischaracterization of the GW Universe once 
detections are made. This pitfall has been labeled fun- 
damental bias [2] in analogy to other types of bias in 
observational astronomy. Imagine, for example, that the 
behavior of gravitation is correctly described by a MG 
theory that allows for scalar radiation only in the very 
late stages of inspiral. If so, the inspiral will proceed 
faster than would be expected in GR, and less power 
may be emitted in GWs during ringdown. Such events 
might still be detected with GR templates, albeit with 
suboptimal efficiency, but they will appear closer than 
they really are, introducing a systematic error in the es- 
timated luminosity distance. 



These tests treat the coefficients in the PN expansion of the 
inspiral-waveform phasing as independent parameters rather 
than fixed functions of the binary parameters, and verify the 
consistency of their measured values. 
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This paper discusses how we may quantify the degree 
to which fundamental bias could lead us to incorrect in- 
ferences about the parameters of detected GW sources. 
The magnitude of the errors introduced will of course de- 
pend on the magnitude of the MG correction. In this first 
study, we will model the correction with the simplest ppE 
inspiral waveform, which includes a single-exponent mod- 
ification to the phasing. We ask the following question: 
As a function of correction exponent b and magnitude 
/3, what is the detection signal-to-noise ratio such that 
the systematic error due to using GR templates is equal 
to the statistical, noise-induced error in the determina- 
tion of the source parameters, such as the chirp mass 
and mass ratio? We then ask: At that SNR, is there 
sufficient statistical evidence, as quantified by Bayesian 
model comparison ^29j, to claim that the MG model is 
preferred? If the systematic error is relatively large, but 
model comparison still prefers GR, we would found our- 
selves in a situation of stealth bias [3] that makes it log- 
ically impossible to correct the systematic effects of MG 
theories. 

Fundamental bias in GW observations is similar to 
the mismodeling bias considered by Cutler and Vallisneri 
(who call it theoretical error [30]): the idea is that the 
approximations used to solve the Einstein equations and 
construct waveform templates may introduce parameter- 
estimation errors that are larger than statistical uncer- 
tainties. Cutler and Vallisneri developed simple formulas 
to estimate the likely mismodeling error; in this paper we 
use them in the ppE context. We also use Vallisneri's re- 
cent formula |31j to compute the Bayesian odds ratio, as 
measured in GW observations, for GR versus a MG the- 
ory linked to GR by one or more continuous parameters. 
These formulas are valid for sufficiently large detection 
SNRs. For weaker signals, a full-scale Monte Carlo ap- 
proach is needed |3]. In this paper, we limit ourselves 
to exemplifying the logic of our approach with simpler 
analytic tools of more limited applicability, and we leave 
a detailed Monte Carlo analysis for future work. 

As an example, we concentrate on the signals from 
the circular-adiabatic inspiral of nonspinning stellar- 
mass binaries, as detected by a network of second- 
generation ground-based observatories. The signals in- 
clude ppE phasing corrections with a range of expo- 
nents and strengths. We find that, for likely detection 
SNRs, three cases are indeed possible, depending on the 
strength of the correction: 

i. Fundamental bias is significant, but statistical ev- 
idence is sufficient to positively detect MG effects, 
so the bias can be corrected; 

ii. Fundamental bias is significant, but statistical evi- 
dence is insufficient to detect MG effects, so stealth 
bias is present; 

iii. Fundamental bias is not significant and MG effects 
cannot be detected. 



For exponents b corresponding to quadrupole- or higher- 
order FN terms (i.e., b > —5/3), stealth bias can occur 
for correction strengths /3 that are not currently excluded 
by other constraints, such as those from binary pulsars. 

This paper is organized as follows. Section [ll] describes 
our formulation; Sec. |III| discusses our analysis and re- 
sults; Sec. provides are conclusions. Throughout this 
paper we use the conventions of Misner, Thorne, and 
Wheeler [32] , such as geometric units with G = c = 1 . 



II. FORIVIULATION 

In this section we describe our analytical formula- 
tion. Subsection |II A| introduces ppE waveforms for in- 
spiraling binaries; Subsecs. |IIB| and |II C| provide details 
about the estimation of fundamental bias and the model- 



comparison detection of MG; Subsec. II D ties all of these 
together to characterize the impact of fundamental bias 
in typical detections and the occurrence of stealth bias. 



A. Systematic bias and the ppE framework 

GW searches performed with matched filtering can 
be affected by systematic bias if the theoretical sig- 
nal templates used in the search do not match exactly 
the true signals that appear in the data. Symbolically, 
/itruo = /ithoory + Sk, where the unmodeled correction Sh 
can have different origins [33] : 

• Mismodeling: caused by the approximations used 
in solving the equations that describe the motion 
of the GW source, and the generation and propa- 
gation of GWs [30]. 

• Instrumental: caused by the approximations 
used in modefing the response of the detector to 
GWs [51[35]Q 

• Astrophysical: caused by unmodeled astro- 
physics (e.g., neutron-star hydrodynamics in NS- 
NS inspirals) or objects (e.g., a third star in the 
close vicinity of a binary) [33l |36l [37] . 

• Fundamental: caused by MG corrections to Ein- 
stein's equations [^[31I3T]. 



^ Of course, the first thing that comes to mind when consider- 
ing instrumental effects is the parameter-estimation error due to 
detector noise, which has a stochastic rather than systematic na- 
ture. These errors may be difficult to characterize statistically 
if the noise is significantly non-Gaussian or nonstationary, but 
they are distinct from the systematic effects that we address in 
this paper, since they can be alleviated only by improving the 
instrument itself, rather than its theoretical description or the 
modeling of sources. 
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Mismodeling bias is remedied by deriving ever more 
accurate solutions to the field equations: for binary in- 
spirals, this goal is currently pursued by pushing the 
PN approximation to higher orders, and by integrating 
together PN and numerical-relativity results with ana- 
lytical resummation and fitting techniques, such as the 
effective-one-body scheme. Instrumental bias is reduced 
by careful detector modeling and characterization. As- 
trophysical bias is expected to be irrelevant for most 
ground-based binary observations; but even if this were 
not the case, astrophysical effects should present them- 
selves differently (or not at all) in observations of differ- 
ent systems, whereas fundamental bias, if present, would 
appear equally in all observed systems. 

In this paper, we concentrate on the inspiral signals 
from compact-binary coalescences. We consider inspirals 
that are circular and adiabatic, with negligible spin ef- 
fects, and neglect mismodeling bias by assuming the GW 
emission is well-described by the restricted PN waveform 
in the frequency-domain, stationary-phase approxima- 
tion [551 - HT] . In GR, the resulting signals can be written 
as 



(1) 



where Agr(/) = Au~''^^ [1 + ' ' ' ] neglect PN ampli- 
tude corrections, symbolized with ellipses in the above 
equation), u — nJHf is the reduced frequency, A4 = 
rf/^M the chirp mass, rj = toiTO2/M^ the symmetric 
mass ratio, M = mi -I- m2 the total mass, and / the GW 
frequency. The constant amplitude A depends on the 
chirp mass, the luminosity distance, and the detector's 
antenna patterns [55H1T] . The quantity 5'gr(/) in Eq. 
([T]) is the GW phase, given in the PN approximation by 
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k=2 



^-fe/5^/c/3 



(2) 



where the constant coefficients {ipk,'4'k) can be found (for 
instance) in Ref. [30] . 

Under these assumptions, the unmodeled corrections 
enumerated above can be represented by a continuous 
and (in principle) predictable deformation of the GW 
phase VE'(/) and amplitude A{f). The particular defor- 
mation depends on the systematic effect. For mismod- 
eling, we expect corrections within the structure of the 
PN series {6 A oc /-7/6+fe/i/3 j^j. ^.j^g amplitude and S'if oc 
j-5/3+fc^/3 £qj. ^Yie phase, with integers kA,k^ > 0), be- 
yond the highest known perturbative order (i.e., fc^ > 5 
for the amplitude and > 7 for the phase). 

For astrophysical effects, we expect corrections to arise 
almost always with "negative" PN exponents [331 ISZl 
132] ■ For example, an accretion disk the presence of a 
third body [3S|, and orbital eccentricity [32] all introduce 
GW phase corrections 6'^ oc f^^/^~^<t> , with integer k'^ > 



0. Physically, this frequency-dependence corresponds to 
astrophysical effects becoming less important for tighter 
binaries, where strong-field effects become dominant. 

Moving on to unmodeled corrections originating from 
fundamental physics, the amplitude and phase deforma- 
tions 6 A and (5^* can always be expressed as sums of fre- 
quency powers, provided that SA and (5^ remain analytic 
at all frequencies sampled during the inspiral: 



K 

(5^ = AoR(/)^afeu'^'= 

k=l 



K 

S^{f) = Y,l3ku' 

k = l 



(3) 



where (afc, Ofc, 6^) e M for all fc, and where we have 
included the prefactor in SA. We have neglected 
possible logarithmic terms for simplicity, but they can 
be included easily in the same fashion. This ppE model 
introduces AK new parameters in the waveform; the sim- 
plest version of this model would allow only a single ex- 
ponent: 



M = Aor(/)«u\ S'f{f)^/3u' 



(4) 



Indeed, it can be shown that such a parametric defor- 
mation is sufficiently general to model all known MG 
corrections to the waveform to leading PN order [2 [3] , 
provided that the two tensor polarizations are dominant, 
as in GR. Otherwise, a second term would be needed in 
the phase and amplitude [331 SI] ■ 

Furthermore, a convincing argument can be made that 
a and b should be restricted to a few discrete values [44] . 
Suppose that the adiabatic-inspiral waveform is derived 
from an energy-balance equation with modified binding 
energy and flux of the form 



E^Eov^l + {■■■) 



PN 



SEv" 



E ^ Env 
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l + {---)p^ + SEv' 



(5) 
(6) 



where v is the relative velocity of the binary components, 
ellipses stand for higher-order PN terms, and Eq, Eq, SE 
and SE are all constants that may depend on the source 
parameters and on the MG coupling constants. The ex- 
ponents k and m must be integers, otherwise E or E 
would not be analytic, and we would lose the guaran- 
tee that the equations have a unique solution of hyper- 
bolic character by the Picard-Lindelof theoren:j^ Fur- 
thermore, we must have k > —2 and m > —10, otherwise 
E and E would not reduce to the GR result in the weak- 
field limit. These constraints lead to the deformations 



h/3 



(7) 



^ Given the differential equation dy/dt = f{t,y{t)), with initial 
value y{to) = yo, a- unique solution exists for all t S (to — e,to + f) 
provided that / is Lipschitz continuous in y and continuous in t. 
A noninteger value of k and m would lead to a differential equa- 
tion with a non-Lipschitz continuous source term, with possible 
loss of uniqueness. 
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where (a, 6) G Z, with (a, 6) > (-10,-15) [H]. 

In this paper we concentrate on phasing cor- 
rections by setting a — and choosing b £ 
{-7, -6, -5, -4, -3, -2, -1, 1, 2}. Different values of b 
represent different types of MG effects: b — —7 cor- 
responds to the leading-PN-order correction in Brans- 
Dicke theory H UMUM US |lS]_or in Einstein-dilaton- 
Gauss-Bonnet gravity (THl Hlj; b = —3 to the leading- 
order term in a phenomenological massive graviton the- 
ory P [in [HI [HI [221 Eg US], & > -5 (but 7^ -4) to 
the modified-PN scheme of Refs. [1H7], and 6 = — 1 to 
dynamical Chern-Simons gravity [121 [51] [57] . Notice, in 
particular, that the modified-PN scheme is clearly a sub- 
case of the ppE scheme. We omit 6 = because the 
resulting correction would be degenerate with an arbi- 
trary constant in the phase. We do not consider b < —7 
because the values that we study provide enough infor- 
mation to observe a consistent trend as b becomes more 
negative. Moreover, for b < —7, binary pulsar observa- 
tions can do a better job at constraining modified gravity 
theories than G Ws observations [48] . We do not consider 
6 > 2, as this would correspond to terms of higher than 
3.5 PN order, which we do not account for in the \E^qj^. 

B. Quantifying the bias 

Let us assume that a GW detection is reported for a 
dataset s that contains the waveform 

/ifuii(4) = m4) + (5/1(4), (8) 

where On is the vector of parameters that describes the 
GW source and source-detector geometry, h is the ap- 
proximated waveform family used to filter the data, and 
Sh is the unmodeled correction to h. Following Cutler 
and Vallisneri [SD], we compute the theoretical error ^^th 
induced by matched-filtering with h instead of /ifuii- 

The theoretical error 66th is defined as the displace- 
ment 0bf ~ ^tr between the true parameters 0tr and the 
best-fit parameters ^bf that would maximize the like- 
lihood in the absence of noise. When 6h{9ti) is neg- 
ligibly small, 0M — Otr', as Sh{9tT) grows in magni- 
tude, 0bf is displaced further and further away along the 
parameter-space direction in which h{9h{) can reproduce 
hiOtr) + Sh{9tr) most closely. 

To leading order in Sh, S9th is given by [30] 

SOth = {F^tT^{hA^M)\Sh{9^f)), (9) 

where = dh/d9^ are partial derivatives of the wave- 
form with respect to source parameters, F^p = (/i,a|/i,^) 
is the Fisher matrix, here evaluated at ^bf i and 

(5il52)=4Re^^^l^d/, (10) 

is the noise- weighted signal inner product, with S'„(/) 
the one-sided power spectral density of detector noise 



(see, e.g., [35] )• The inner product defines a signal norm 
\h\ by way of = {h\h). 

In Eq. Q, the waveform correction Sh is projected 
onto the waveform derivatives, and the projection cosines 
are mapped into parameter errors by the inverse Fisher 
matrix F^^, thus taking into account possible parameter 
covariances. Note that the resulting S9th is independent 
of the detection SNR, since both F and {h,p{9h{)\Sh{9bf)) 
are quadratic in the waveform amplitude. 

Equation ^ is only accurate for small S9th — more pre- 
cisely, for perturbations small enough that h{9\^i—S9th) — 
h{9hi) — h^aS9"^. Cutler and Valhsneri [30^ discuss more 
sophisticated versions of Eq. (|9| that can be applied to 
larger perturbations, but in this paper we adopt the sim- 
pler Eq. (|9|, not least because the other ingredients in 
our formulation depend on Sh{9fc) being small. 



C. Detecting modified gravity 

Following Vallisneri [31] (see also [3]), we define a MG 
correction Sh to the signal h to be detectable when the 
odds ratio of the Bayesian evidences for the MG and 
pure-GR scenarios, used as a detection statistic, is large 
enough that the false-alarm probability of favoring the 
MG hypothesis when GR is in fact correct is suitably 
small. More precisely, we compute the odds ratio 

^ _ P(MG|s) _ P(MG) / p{s\e,X)p{9,X)A^9A"'X 
~ P(GR|s) ~ P{GR)^p{s\9)p{9)d^^9 ' 

(11) 

where P(MG) and P(GR) are the prior probabilities that 
MG and GR are correct, p{s\9,\) is the likelihood that 
the detector data s contains the MG waveform h{9) -\- 
Sh{9, A), p{s\9) is the likelihood that s contains the pure- 
GR waveform h{9), and p{9, A) ~ p{9)p{\) and p{9) are 
the prior probability densities for the source parameters 
9 and the MG parameters A. 

If the true signal is MG, using MG templates would 
improve the fit to the data and increase the maximum 
value attained by the MG likelihood relative to the GR 
likelihood. On the other hand, the evidence for the more 
complicated, higher-dimensional MG model is reduced 
by the smaller prior mass within the support of the 
likelihood — the mechanism by which Bayesian inference 
embodies Occam's principle of parsimony. As signals get 
stronger, the improvement in the likelihood grows expo- 
nentially with the (squared) detection SNR, and eventu- 
ally it overcomes the effect of the priors. 

Even if we fix the true signal, the odds ratio O remains 
a random variable, because it depends on the realization 
of detector noise, by way of the likelihoods. For suffi- 
ciently large detection SNR, it can be shown [31] that 
Eq. becomes remarkably simple: for the cases when 
the underlying signal is pure-GR or MG respectively, we 
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find that 



(12) 



In this equation: 



• The dependence on the noise realization enters ex- 
clusively through X, a normal random variable with 
zero mean and unit variance. Technically, x is given 
by the inner product of detector noise with the 
"MG-unique" component of the MG correction Sh 
(the component that is orthogonal to the deriva- 
tives of the signal with respect to the GR parame- 
ters). 

• The constant of proportionality is the same in both 
rows: it is a function of P(MG), P(GR), and of the 
estimation errors and prior-density widths for the 
MG parameters A. As we shall see below, the fact 
that this constant is the same under both hypothe- 
ses allows us to disregard it when we analyze the 
statistics of our detection scheme. 



• Last, SNRmg = SNR^/l - FF is (by definition) 
the effective MG-detection SNR, with FF the fitting 
factor between the pure-GR and MG waveforms. 



FF(6', A) = max 



{h{e')\h{e) + sh{e,x)) 



S' \h{e')\x\h{e) + 5h{e,X)\ 



(13) 



This FF describes the maximum fraction of detec- 
tion SNR that can be recovered when searching for 
MG signals using GR templates. The FF is itself 
independent of SNR, and in our application it is 
given by 



(fe(4 + (^^th)|fe(4) + ShjOt,, A)) 

\h0tr + 5^th)| X \h0tr) + 5h{K, A)| 



(14) 



or also by 1 - i|5/i(4, A)| V|/i(4)P [U.- These 
formulas are equivalent, but only to leading order 
in A. 



By solving both rows of Eq. ( 12 ) for x as a function of 



O, we can derive the cumulative probability distributions 
of Ogr E^nd Omg from the normal probability distribu- 
tion p{x) = /^/\/27r. For instance. 



P{OoK > O* 



/•OO 



Jx{0- 



p{x) dx : 



(15) 



this is exactly the false-alarm probability Pp in a de- 
cision scheme that declares an MG detection when the 
odds ratio O computed from the data is larger than the 
threshold O* . By contrast, the true-detection proba- 
bility (also known as the detection efficiency) is given 
by Pe = P{Omg > C*), which we compute along the 
lines of Eq. ([l5|) , and which is a function of SNRmg ■ In 



practice, we set O* by requiring a sufficiently small Pp. 
In this paper, we use Pp = 10"'*, which seems appro- 
priate for the tens of detections expected from second- 
generation ground-based detectors (but see also the dis- 
cussion in Sec. Ill of [31]). 

We can now replace the vague statement that began 
this section with a precise quantitative condition: we 
will say that a MG correction becomes detectable once 
SNRmg is large enough to yield an efficiency of 1/2. This 
is a conventional value, but once Pp reaches 1/2 it grows 
rapidly to 1 with increasing SNRmg- For Pp — 10~*, we 
get P p = 1/2 when SNRmg = 2.75. Because SNRmg = 
SNRVl - FF, the SNR at which MG effects are de- 



tectable is then simply SNR' 



detect 



= 2.75(1 - FF)-i/2^ 



We thus confirm our expectation that MG detectability 
improves for larger detection SNR or for larger MG cor- 
rections, which lead to smaller FF. 



D. Statistical error and stealth bias 

We have by now collected the tools to estimate the fun- 
damental error J^th due to MG corrections, and to deter- 
mine the detection SNR necessary to positively detect the 
presence of MG corrections of a certain magnitude. The 
last ingredient in our formulation is the statistical error 
in the determination of the source parameters 9, which 
we estimate as the (square-root) diagonal elements of the 
inverse Fisher matrix, computed at ^tr for GR waveforms: 



(16) 



(no summation is implied by the repeated index). This 
expression implies that S9"^^^ scales as 1/SNR. Indeed, 
to leading order in 1/SNR, the inverse Fisher matrix de- 
scribes the variance of the maximum-likelihood param- 
eter estimator across noise realizations, as well as the 
shape of the Bayesian posterior parameter distribution 
around its mode when priors can be neglected ^49j . 

For a fixed MG-correction magnitude (a fixed /3), the 
statistical error decreases with increasing detection SNR, 
but the systematic bias remains constant and eventually 
becomes the limiting factor for parameter-estimation ac- 
curacy. If this happens before MG can be positively 
detected, the observation may suffer from stealth bias. 
Thus, we will look for the detection SNR at which 
maxc (56'"ij/(56'"tat — li ^-rid compare it to the detection 
SNR at which SNRmg = SNR^l - FF ==2.75. We 
will denote these two limiting SNRs as SNR*'''"'' and 
gj^j^dotcct^ respectively. 

We should not forget that our formulation is only valid 
for sufficiently large SNRs. It is difficult in general to 
determine if higher-order contributions change an ana- 
lytical result without actually computing them, but we 
can at least check that the leading-order result is self- 
consistent. Indeed, we adopt Vallisneri's 2008 criterion 
[15] for the statistical errors predicted by the Fisher- 
matrix formalism, and we require that the error made 
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in approximating h{9ti + SOstat) — h{9ti) as h^aSO"^^^ be 
sufficiently small (0.1 in norm) on most (95%) of the 1-a 
error surface described by (F^^)"^. For the theoretical 
errors, we require that the FFs computed in the two ways 
described below Eq. (13 1 be consistent to 1%. 



III. ANALYSIS AND RESULTS 

We examine three representative binary GW sources 
for second-generation interferometric detectors such as 
Advanced LIGO 150]: neutron-star-neutron-star bina- 
ries with (1.4-1- 1.4)M0 component masses; neutron-star- 
black-hole binaries with (1.4 -I- 5)Mq masses; and black- 
hole-black-hole binaries with (5 -I- 10)Mq masses. 

We concentrate on the inspiral phase of coalescence, 
which we model as quadrupolar and adiabatically quasi- 
circular with 3.5PN-accurate phasing. We truncate the 
waves at the innermost stable circular orbit of a point- 
particle in a Schwarzschild background (assuming GR), 
and we neglect spin effects and PN amplitude corrections. 
The resulting waveforms are described by nine parame- 
ters: the two masses (or the chirp and reduced masses), 
the time and inspiral phase at coalescence, two sky- 
position angles, two angles that describe the binary incli- 
nation and GW polarization, and the luminosity distance 
(see |30j and |3] for a similar waveform prescription). 
We assume a simultaneous detection by three second- 
generation detectors with the LIGO Hanford, LIGO Liv- 
ingston, and Virgo geometries and relative delays [5T] . 
and with identical broadband-configuration power spec- 
tral densities, as given by Eq. (10) of Furthermore, 
we assume that GW-detector noise is Gaussian and sta- 
tionary, as required by the Cutler- Vallisneri [3D] and Val- 
lisneri [31! formalisms. 

For these systems, we consid er ppE phasing corrections 
as described in Sec. 



and SNR'^"'"''* as a function of the MG-correction mag- 
nitude /3 for a range of exponents b. For each mass com- 
bination, each b, and each /3, we randomly select 1,000 
configurations of the phase and angle parameters from 
the appropriate uniform distributions (e.g., sky positions 
are chosen randomly on the celestial sphere). The lumi- 
nosity distance is reabsorbed in the SNR scaling, while 
the time of coalescence has no effect on our computa- 
tion. For each configuration we compute SNR'^°*°'^* and 
SNR'"'*^, and we report their median values. The con- 
dition max„ S9"^/66"^^^ — 1 that yields the latter is al- 
most invariably satisfied first for the chirp-mass parame- 
ter. Statistical fiuctuations around the median turn out 
to be rather small (a few percent). 

The i56'stat consistency check is satisfied for detection 
SNRs ranging from 10 to 100, typically ~ 50, but our 
results for lower SNRs should be at least representative 
of trends. The (50th check is satisfied for a maximum /3 
that depends on b, and which sets the largest f3 that we 
investigate. By contrast, the smallest f3 that we study 
corresponds to min(SNR'''='°^*, SNR*''''") = 100, a rela- 



II A 



and we compare SNR 




NS+BH 




BH+BH fi 

FIG. 1. SNR'*°*°'=* (solid curves) and SNR'''='= (dashed 
curves) as a function of /3 for b = —7, —6, —5, —4, —3, —2, —1,1 
and 2 (left to right), for a NS-NS system with (mi,m2) = 
(1.4, 1.4) M0 (top), a NS-BH system with (mi,m2) = 
(1.4, 5) Mq (mid) and a BH-BH system (bottom) with 
(mi, 7712) = (5, 10) Mq. The symbols in the top plot are de- 
scribed in the main text. 



lively large detection SNR that would be achieved very 
rarely in volume-limited searches |52| . 

Figure [l] presents the main results of this paper, with 
the solid curves plotting SNR'^''*'''^* (once again, the detec- 
tion SNR above which MG can be detected positively), 
and the dashed curves plotting SNR'^''*'' (the SNR above 
which the largest ratio of fundamental error to statistical 
error reaches one). Both sets of curves are plotted as a 
function of the MG-correction magnitude (3; the curves in 
each set correspond to 6 = —7, —5, —4, —3, —2, —1, 1, and 
2, from left to right as labeled. The top, mid, and bottom 
panels report results for our (1.4 + 1.4)1/©, (1.4 + 5)M0, 
and (5 -I- 10)Mq systems, respectively. 

This figure reveals a few interesting features. First, 
for the same detection SNR, more massive systems re- 
quire larger /3 before MG can be detected. This must 
happen because the larger the mass, the fewer the num- 
ber of useful GW cycles in the detector's band, so the 
signals become relatively featureless, and higher FFs can 
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be obtained for the same /?. Of course, this feature would 
probably be affected if one included the merger and ring- 
down phases of coalescence. Second, all the curves are 
relatively steep, with typical detection SNRs mapping 
out a small range of (5 for each b. Indeed, our analyti- 
cal formulation predicts SNR'^'='°"* and SNR*'*''" propor- 
tional to f3~^ at leading order (this is because Sh cx /3, 

and both S9th and SNRmg c< Sh). Third, while we com- 
puted all our results for positive /3, our formulation is 
invariant with respect to a change in its sign: with ev- 
erything else held fixed, the change yields 6h — ?> —Sh 
and (5^th ^ -SOth, which leaves SNR^'="='=* and SNR*"'^" 
unchanged. Fourth, and most important, the SNR'^'^'' 
curves lie almost always to the left of the SNR'^'''"'^* 
curves — stealth bias (which corresponds to the area be- 
tween the solid and dashed line) is a generic feature for 
these waveforms, and becomes more prevalent for more 
negative b. 

Thus, if ground-based GW detectors observe a signal 
from these binaries with a certain detection SNR, and 
Nature happens to deviate from GR in the strong-field 
regime, three cases are possible (with somewhat blurry 
boundaries) depending on the value of /3. They are il- 
lustrated by the small symbols in the top panel of Fig. [T] 
and listed below: 

• Overt bias: significant fundamental bias, 
detectable MG deviation: SNR^'''"(/3) < 
gjv^j^dotcct^^^ < SNR (rightmost symbol, star). 
The systematic error induced by using GR tem- 
plates is larger than the statistical uncertainty, but 
we are able to infer that the signal is described by 
a non-GR theory. Thus, we can correct the funda- 
mental bias, as long as our MG model is sufficiently 
close to the true theory of gravity. 

• Stealth bias: significant fundamental bias, 
undetectable MG deviation: SNR'''''"(/3) < 
SNR < SNR^'="='=*(/3) (middle symbol, square). The 
systematic error is larger than the statistical un- 
certainty, but we do not have sufficient statistical 
evidence to determine that the signal is described 
by a MG theory. Thus, we cannot correct the fun- 
damental bias. Parameter-estimation results might 
have to be prefaced with the caveat that they may 
include a systematic error as large as the statistical 
uncertainty if GR is incorrect in the strong-field 
with j3 in this range. 

• Negligible bias: insignificant fundamental 
bias, undetectable MG deviation: SNR < 

gj^j^bias^^^ < SNR^°"='=*(^) (leftmost symbol, cir- 
cle). Although we cannot determine that the sig- 
nal is described by a MG theory (i.e., the signal 
is indistinguishable from the GR prediction), the 
systematic error is smaller than the statistical un- 
certainty, so the accuracy of parameter estimation 
is not affected. This is also trivially the case when 
GR is correct. 



By contrast, if we had found that in general SNR'^'*'''(/3) > 
SNR'^'''°'^*(/3), systematic error would only appear in ob- 
servations where the presence of MG corrections is ob- 
vious. Although our analysis is robust for the sources 
we examined, we cannot extend our conclusions to dif- 
ferent sources, different waveform families, or different 
detectors. We leave this to future work. 

Of course, overt and (particularly) stealth bias are 
only a concern if they are not already excluded a pri- 
ori by previously obtained constraints on /3 for a given b. 
For instance, binary-pulsar observations |48| provide very 
strong constraints for b < —6 (see Table but deterio- 
rate rapidly for corrections corresponding to higher-order 
PN terms 3 . For this reason, it would be very interest- 
ing to map all current binary-pulsar and Solar- System 
constraints to the ppE framework. 



IV. CONCLUSIONS 

In this paper we introduced a scheme to determine if 
the systematic errors induced by the a priori assumption 
that GR is always correct could become as significant 
as noise-induced statistical uncertainties before a MG ef- 
fect could be detected confidently using Bayesian model 
comparison. We find that indeed such stealth biases ap- 
pear generically in analysis of inspiral GWs from stellar- 
mass compact binaries, as observed by second-generation 
ground-based detectors, at least for certain ranges of de- 
tection SNRs and MG correction strengths. 

The possibility of stealth bias should not be regarded 
as casting a veil of uncertainty on the interpretation of 
GW detections. Rather, it is a warning that inferences 
from GW observations must be considered in the con- 
text of all the other evidence for GR as the true theory 
of gravitation, as well as all other constraints on MG 
theories. 

For simplicity, we studied non-eccentric, non- 
precessing systems with no PN amplitude corrections to 
the waveforms, concentrating only on second-generation 
ground-based detectors. We employed a simple analytical 
formulation that is valid for sufficiently large detection 
SNRs. Obvious extensions of this work could encompass 
more complete waveform models (perhaps including also 
the merger and ringdown phases of binary coalescence, 
using the hybrid ppE models); different source popula- 
tions; low-SNR detections, which can be tackled with 
Cutler and Vallisneri's "ODE" approach to evaluate fun- 
damental bias [30] and with Monte Carlo integration to 
evaluate the Bayesian odds ratio [3^; space-borne detec- 
tors or third-generation ground-based detectors. Indeed, 
we have no doubt that the full power of these techniques 
will be unleashed once actual detections are made, to look 
for hints of MG theories and put GR on an ever-firmer 
footing. 
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b 


-7 


-6 


-5 


-4 


-3 


-2 


~1 


1 


2 


upper limit on /3 


4 X 10"^° 


4 X 10"^ 


3 X 10-* 


3 X 10"^ 


3 X 10^ 


3 X 10^ 


5 X 10* 


4 X 10" 


2 X lO^'^ 



TABLE I. Binary pulsar constraints on f} for different values of 6 |48| . 
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